On the Need of Cross Validation for Discourse Relation Classification
نویسندگان
چکیده
The task of implicit discourse relation classification has received increased attention in recent years, including two CoNNL shared tasks on the topic. Existing machine learning models for the task train on sections 2-21 of the PDTB and test on section 23, which includes a total of 761 implicit discourse relations. In this paper, we’d like to make a methodological point, arguing that the standard test set is too small to draw conclusions about whether the inclusion of certain features constitute a genuine improvement, or whether one got lucky with some properties of the test set, and argue for the adoption of cross validation for the discourse relation classification task by the community.
منابع مشابه
Can neighboring relations help to anticipate upcoming discourse relations?
Discourse relations between text segments can be explicitly marked using discourse connectors (e.g., because), but by far not all discourse relations are marked explicitly. However, people are usually able to infer the intended relation. What kinds of cues do people rely on? Could they in principle use structural cues from the preceding discourse to anticipate upcoming discourse relations? This...
متن کاملشناسائی رابطه تقابل در گفتمان فارسی به کمک روش های یادگیری باسرپرستی
Discourse is a part of language that intend is used to communicate. A discourse relation recognition system can identify one or more relation between the textual units in a discourse. Like other languages, Contrast relation is a one of the available relations in Persian discourse. Contrast relation recognition in discourse is useful for generation and perception of discourse, paraphrasing and ...
متن کاملCross-lingual Discourse Relation Analysis: A corpus study and a semi-supervised classification system
We present a cross-lingual discourse relation analysis based on a parallel corpus with discourse information available only for one language. First, we conduct a corpus study to explore differences in discourse organization between Chinese and English, including differences in information packaging, implicit/explicit discourse expression divergence, and discourse connective ambiguities. Second,...
متن کاملClassification of emotional speech using spectral pattern features
Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...
متن کاملMethodology for Validation of Issuance of Mystical and Ethical Narrations (A Case Study and Discourse Analysis on the Methodology of the Book Sirr ul-asra’)
The Book “the Secret of Prophet Mohammad’s Midnight Journey to the Seven Heavens in Explanation of Al-Mi’raj Hadith” is written by Ayatollah Sa’adatparvar. Analyzing the discourse of a part of its introduction, his recognition method about this hadith has been investigated in this paper. The paper aims at investigating the particular discourse pattern of the author in analyzing the document of ...
متن کامل